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Abstract 

The comparison of various electronic-optical vision systems has been conducted based on the criterion ultimate 
information capacity, C, limited by fluctuations of the flux of quanta. The information capacity of daylight, night, and 
thermal vision systems is determined first of all by the number of picture elements, M, in the optical system. Each element, 
under a sufficient level of irradiation, can transfer about one byte of information for the standard frame time and so C » M 
bytes per frame. The value of the proportionality factor, one byte per picture element, is referred to systems of daylight and 
thermal vision, in which a photocharge in a unit cell of the imager is limited by storage capacity, and in general it varies 
within a small interval of 0.5 byte per frame for night vision systems to 2 bytes per frame for ideal thermal imagers. The 
ultimate specific information capacity, C”\ of electronic vision systems under low irradiation levels rises with increasing 
density of optical channels until the number of the irradiance gradations that can be distinguished becomes less than two in 
each channel. In this case, the maximum value of C * turns out to be proportional to the flux of quanta coming from an 
object under observation. Under a high level of irradiation, C* is limited by diffraction effects and amounts to 1/A 2 
bytes/cm 2 frame. 
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1. Introduction 

The design of electronic-vision and other systems 
involves the optimisation of many tactical and tech¬ 
nical parameters. Optimization of the information 
capacity — the most important parameter of elec¬ 
tronic-vision systems according to Rose [1] — leads 
to very different considerations and results from the 
standard ones [2], We will calculate the ultimate 
information capacity of electronic-vision systems 
limited by photon fluctuations, find the diminishing 
factors in this and show the ways to reduce these 
losses. 


Corresponding author. 


2. Expressions for calculation of information ca¬ 
pacity of electronic vision systems 


According to Rose’s concept of the ultimate reso¬ 
lution limited by photon fluctuations [1] and Shan¬ 
non’s definition of the information capacity [3], the 
authors of Ref. [4] used the following relationship for 
calculation of the ultimate information capacity per 
frame, C, of electronic-vision systems, expressed in 
bytes: 



( 1 ) 
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The notation introduced and necessary comments 
on them will now be described: 

(1) M is the number of the picture elements in the 
focal plane image of an object. It should not be 
confused with the number of photosensitive unit 
cells (photosensitive elements) in a photodetector 
device. The number of these unit cells will be de¬ 
noted by M e hereafter. Naturally, one and the same 
number of the picture elements in a frame can be 
obtained using a staring two-dimensional (2D) array 
photodetector as well as a scanned linear-array or 
even single-element detector device. In the last cases, 
the number of cells, M e > 1, is considerably less 
than the number of picture elements, M. So for 
example, a square frame image consisting of M = m 
X m picture elements can be examined either with 
the use of a 2D array detector with the same number 
of unit cells, M e = M = m 2 , or by one-dimensional 
scanning of a linear array detector with Af e = m. In 
case a 2D array detector and a linear one having an 
equal numbers of cells, M e , are used, the former 
decomposes the image into M = M e elements and 
the latter decomposes it into a much greater number 
of elements, in particular, into M = A/ e 2 elements if 
the scanning direction is perpendicular to the linear 
array and the frame is square. These typical exam¬ 
ples will be analyzed below while maximizing the C 
value. It is not difficult to obtain the relationship 
between M and M e for other cases, particularly, for 
linear multi-row detector arrays or SPRITE devices. 

(2) k 0 is the minimum signal-to-noise ratio in a 
unit cell at which the permissible errors in signal 
measurement (the errors of non-detection and false 
alarm) are ensured [1]. 

(3) (3 is the angle of the projection of the objec¬ 
tive or the plane angle at which a photosensitive unit 
cell ‘sees’ the exit pupil of the objective. According 
to this, p/2 is the back aperture angle of the objec¬ 
tive. If the distance from an observed object is much 
greater than the linear dimensions of the objective, 
then the image is projected practically in the focal 
plane and the sine of the said angle is determined by 
the diameter of the exit pupil, D', and the focus 
distance, /: 


P D'/2f 

sin— = . . . 

2 V1 + ( D'/2f) 2 


( 2 ) 


Often, the approximation sin( p/2) ~ D/2f is 
used, where D is the diameter of the entrance pupil 
of the objective. 

(4) t) is the effective quantum yield of a unit cell 
taking into account the losses in a photosensitive 
element and the ratio of the element area to the area 
of a picture element (additional losses arise if the 
element area is less than that of a picture element). 
In general, the losses in the optical and scanning 
systems, etc. can be taken into account with the use 
of the factor rj as well. 

(5) F is the noise factor. When the radiant noise 
is dominant (and this case of BLIP detection [1] will 
be considered here), F changes within a small inter¬ 
val depending on the spectral range of a photosensi¬ 
tive element [5,6]. A value of F = 1 corresponds to 
the short-wavelength radiation (the energy of the 
photon, hv, is greater than the thermal energy kT), 
for which the noise is the Poisson’s one. F= 1.37 
corresponds to the total radiation of a blackbody 
source in the whole spectral range of 0 to °° and is 
conditioned by extra-Poisson noise under grouping 
of the photons with the low energy values, hv<zkT. 

(6) A e is the area of a unit cell (a photosensitive 
element), determining the area of a picture element 
in the focal plane as well. With the total number of 
picture elements equal to M, the frame area, A f , i.e., 
the area of the image in the focal plane, is: 

A f = MA t . (3) 

The area of a picture element in the two cases 
considered above is related to the area of the frame 
by: 



in a 2D array, and by 



in a linear array. 

(7) t j is the time for integration of a signal (a 
charge) for each picture element of the image. In 
electronic vision systems, the frame time, T f , is 
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usually set at the value which is intended for the 
analysis of a given image. In a 2D array, with 
M = M e , a signal from each picture element is formed 
in its individual cell, which is why this signal can be 
integrated practically for the whole frame time and 
t t ~ T f . With M e < M, particularly for a linear array, 
for the intended time T f , one unit cell should exam¬ 
ine m = M/M e picture elements, therefore, the inte¬ 
gration time cannot be more than 


k 0 n n = k 0 ^Frjn e , i.e., the number of the levels 
(gradations) of signal that can be distinguished are 



= -pin(/3/2 )^jA et[ N e . (8) 



( 8 ) N e is the maximum (for a given system and 
conditions of observation) quantity of the effective 
photons emitted and reflected by a unit of object area 
per time unit. The term ‘effective’ signifies that the 
value N e is determined for the limited spectral range, 
A s -A m , of a photosensitive element, where A s and 
A m are the short and the long cutoff wavelengths of 
the element respectively. The total number of quanta 
in the whole spectrum of the object radiation will be 
denoted by N. Naturally, N C = N only for the unlim¬ 
ited spectral range of a sensitive element (A from 0 
to *). 

The structure of Eq. (1) is obvious. If the density 
of effective quanta emitted by an object is N e (in the 
range A s -A m ), then in the focal plane and without 
taking into consideration the losses in the objective, 
it amounts to A e sin 2 ( /3/2): the object radiation is as 
usually assumed to follow the Lambert’s law [2]. So, 
the term sin 2 ( /3/2) is the coefficient of collection 
(transference) for an ideal objective. For the time t„ 
n e = t i A e A' e sin 2 ( /3/2) quanta get to the area A e and 
the photosensitive element registers only the fraction 
17 . And just this quantity, n s — t) n e , is a signal. The 
intrinsic (radiant) noise of the nu mber of quanta 
obtained is equal to n n = y , Frjn e . The system is 
capable of distinguishing (with permissible errors) 
the increment in signal that is k 0 times the noise. 


According to Shannon, just the binary logarithm 
of this number is a measure of the information 
capacity of a unit cell expressed in bits, i.e., it 
amounts to the number of bits, b, that is required to 
express the number K in the binary system: K~2 b . 
Hence, the number of bytes equals b/ 8 = 
(log 2 F)/ 8 . Each picture element is an individual 
(independent) optical channel of information trans¬ 
ference in fact. The information capacity of a multi¬ 
channel optical system is determined by the sum of 
those of M channels hence it is equal to 
(Aflog 2 AO /8 bytes. So, we obtain from Eq. ( 8 ) Eq. 
( 1 ). 

Note that the number K can also be considered as 
the maximum signal-to-noise ratio for a picture ele¬ 
ment (the dynamic range of a picture element) nor¬ 
malized to the required ratio k 0 . 

The expression for C from Eq. (1) presented 
above characterizes capabilities of a vision system in 
information capacity. When deriving, the noise was 
supposed to be limited by the fluctuations of the flux 
n e of photons which cannot be removed in principle 
and the density of emission from each element of an 
object to take any value from zero to N e . As nominal 
values of the N e , we will choose its statistical values 
for each electronic vision system (Section 3). Just the 
consideration of the described model is first of all 
the subject of the present paper. Naturally, images of 
real objects are determined by features of these 
objects and conditions of observation. The effect of 
some factors on the volume of the information trans¬ 
ferred via the image is discussed in Section 8 . 

The dependence of Eq. (1) of the information 
capacity C on all the parameters of the problem is so 
obvious that the traditional order of setting forth can 
be violated and a number of conclusions can be 
formulated at once. 
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The value C being proportional to M, in order to 
increase C, the number of picture elements, M (and 
just the picture elements, but not the unit cells in the 
photodetector device) should be increased first of all. 
The rest of the parameters of the system [ 17 , t„ N e , 
sin( /3/2), F] enter into the relationship for C under 
the logarithm symbol and so they influence C far 
weaker. Therefore, one can set no strict requirements 
to the quantum yield and use not only 2D array 
detectors but linear ones as well. The principal dif¬ 
ference of a linear array detector from the 2D one 
concerning signal processing consists in decrease of 
?i, and this time, as we noted just above, weakly 
influences the value C. A linear array can be even 
more preferable than the 2D array: if the use of it 
creates great possibilities for additional increase of 
the number of picture elements as in the example 
considered above. 

Since the information capacity C weakly depends 
on N e , it also changes little on variations of the 
spectral range, A s -A m , the total number of photons, 
N, and even the source of radiation (up to reasonable 
limits) if only the numbers of picture elements, M, 
in the vision systems compared are equal to each 
other. 

Let us corroborate the conclusions formulated 
above with concrete numerical calculations. In order 
to do this, it is convenient to modify Eq. (1) for the 
capacity C and to separate the terms in this relation¬ 
ship determining the value C for an ideal photode¬ 
tector device from the terms expressing losses of C 
in real devices. An ideal device should register all 
the quanta incident upon it and bring no additional 
noise, then the normalized signal-to-noise ratio K 
and, consequently, the value C ~ log K will be 
maximum. These conditions are valid with: 

• the quantum yield to be 17 = 1 ; 

• the maximum integration time, t t = T { , i.e., with 
the use of a 2D array with M e = M\ 

• the spectral range of the photosensitive element to 
be infinitely wide, i.e., extended from 0 to °°, then 
N c = N; 

• BLIP detection to take place. 

Let us express log K in Eq. (1) in terms of the 
said parameters, T f , N: 

C = 0.415A/ |log[—sinj 



1 T f N 1 F 0 \ 

“I ,og ^r + I 108 “'' 


( 9 ) 


The meaning of the proposed relationship, Eq. 
(9), is clear. In an ideal photodetector device, 
T f N/(,r l t i N e )= 1, F 0 /F = 1, which is why the two 
last logarithms turn into zero and so the first loga¬ 
rithm determines the maximum value of C. In a real 
photodetector device T f N/(t)t l N e ) > 1, therefore, the 
second logarithm determines the level of losses in 
the value C. The maximum value of the third term, 

1 F 0 1 

max—log— = —log 1.37 = 0.07, (10) 

is much less than the values of the first two ones (the 
following evaluations give 3.0-4.5 for them) and so 
the last logarithm can be neglected hereafter. 

Let us start the analysis from the comparison of 
the C values for diverse electronic vision systems. 


3. Information capacity of daylight, night, and 
thermal vision systems 

3.1. Daylight vision systems 

In daylight vision systems, the image of an object 
is formed by the direct and diffused radiation of the 
sun reflected from this object. In Ref. [4], a tech¬ 
nique is described for the calculation of the flux 
density, N , of the quanta reflected from vertical 
objects situated near the surface of the earth on its 
illuminated hemisphere. 

The technique is based on the statistical data of 
Refs. [7,8] on the radiant balance of the earth. The 
radiation of the sun was calculated in Ref. [4] as a 
blackbody source at a temperature of T s = 5785 K. 
Thereafter, the density of quanta at the sunlit bound¬ 
ary of the atmosphere was found. The earth surface 
is reached by, on the average, 0.3 of the sun radia¬ 
tion in the form of directly incident sun radiation and 
additionally 0.18 of this radiation is in the form of 
atmosphere-diffused radiation [7,8], If the fact is 
taken into account that the area of sunlit earth hemi¬ 
sphere is twice as large as the cross section area 
from which the sun radiation is collected for the 
earth and the mean value of the daylight irradiance 
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for vertical objects is about 1.5 more than for hori¬ 
zontal ones [4], then we obtain 


( 11 ) 


Thus, the flux of quanta near the surface of the 
earth turns out to be less than that on the sunlit 
boundary of the atmosphere: by a factor of 2.78 for 
vertical objects and averaged day conditions. With 
the maximum value of reflectance, R = 1, such a 
technique leads to the value 
N=2.29x 10 ' 7 cm -2 s" 1 . (12) 

For subsequent evaluations, it is reasonable to 
take T f to be equal to the time of the TV frame, 
4 X 10 ~ 2 s. The area of an element (unit cell) typical 
of multielement photodetector devices is A e = 30 X 
30 ^m 2 = 9 X 10 -6 cm 2 . Let us take a conditional 
objective with the relative opening hole of D:f= 1:1, 
then sin( /3/2)» 0.45. The required value of k 0 = 5 
[1], Substituting the assumed numerical values of the 
parameters for the system into Eq. (9), we obtain: 

/ 1 T f N 

C = 0.415 M 4.34 - -log-— 

\ 2 T\t i N t 

I T f N \ 

= 1.8 M 1 -0.115 -log-— . (13) 

\ -nhK I 


3.2. Night vision systems 

For night vision systems, evaluation of the infor¬ 
mation capacity, C, was based on the data of Ref. 
[9], in which the irradiance of the earth is from the 
stars in the absence of the moon and clouds. From 
this the maximum density of quanta can be calcu¬ 
lated: it turns out to be equal to 
N» 10" cm -2 s~'. (14) 

At this value of N, the expression for the informa¬ 
tion capacity C in Eq. (11) is: 

/ T f N \ 

C = 0.51 Af 1 — 0.41og- . (15) 

\ Vh N t) 

3.3. Thermal vision systems 

For the calculation of C, let us assume that the 
maximum value of N corresponds to the thermal 


radiation of a blackbody source at the temperature 
equal to the averaged temperature of the earth ac¬ 
cording to the thermal radiation T E = 293 K [4], 
According to the Stefan-Boltzmann law for a flux of 
particles [ 10 ]: 

N=o-Ji = \.52X 10“(293) 3 

= 3.82X 10 18 cm -2 s -1 . (16) 

Substituting this value into Eq. (9), we obtain: 

[ TfN \ 

C=2M 1-0.1 log- . (17) 

\ VtiK) 

Certainly, the values of the parameters A e , 7}, 
sin( /3/2), k 0 can change from one system to another 
and differ from the nominal values assumed above, 
therefore, the numerical factors in Eqs. (13), (15), 
and (17) are not a dogma yet, but they are near to a 
dogma! Firstly, variations of the said parameters are 
insignificant as compared with the variation of N 
[usually, A t » 10 x 10-50 X 50 fim 2 , the relative 
aperture is 1:1—1:2, and so sin( /3/2) ® 0.45-0.24]. 
Secondly, a logarithmic dependence smoothes varia¬ 
tions of these parameters. For the typical limits of 
variations of the parameters, the variation of the 
numerical factors in the relationships for C of day¬ 
light and thermal vision systems, Eqs. (13), (17), 
falls inside the limits of ±( 2 - 8 . 6 )% and that in the 
relationship for a night vision system, Eq. (15), lies 
within an interval of ±(9-30)%. 

The relationships (13), (15), and (17) make it 
possible to evaluate the information capacity of the 
three systems. The value C reaches its maximum 
value, naturally, with the use of both an ideal objec¬ 
tive and an ideal 2D array detector accommodated 
with a source of radiation in the spectral range and is 
determined by the first co-multiplier in the relation¬ 
ships given above. 

The greatest flux N of quanta takes place in 
thermal vision and the information capacity of this 
system is therefore theoretically the highest, all other 
conditions being equal. From Eq. (17), a handy rule 
follows: the ultimate information capacity of one 
frame in an ideal thermal imager equals double the 
number of picture elements: 

C = 2 Mbytes. (18) 

In daylight vision, the value N decreases by more 
than an order of magnitude, but since C weakly 
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depends on N (C ~ log AO, a decrease of the capac¬ 
ity C itself does not exceed 10% [compare Eqs. (13) 
and (17)]. 

A larger decrease of C takes place in night vision 
systems: it is one fourth as high. But if the fact is 
taken into account that the flux of quanta, N, de¬ 
creases by l\ orders of magnitude here, then such a 
decrease of C does not really seem to be excessive. 

As can be seen, even under a very low flux of 
quanta (ca. 10 " cm -2 s _1 ), the potential of the 
image field turns out to be sufficient for the opera¬ 
tion of an electronic vision system (night vision). 
And here, it is natural to put the question: what is the 
minimum flux of quanta, N min , with which the image 
will be still formed in the electronic vision systems 
described that is sufficient for the transfer of infor¬ 
mation? 

Let only two logical states, ‘zero’ and ‘unity’, be 
transferred through each optical channel, which cor¬ 
responds to two gradations in the image, i.e., K 
appears to take the minimum possible value, K min = 
2. It is evident that such an electronic vision system 
is capable of transferring brightnesses of grey sur¬ 
faces as well on account of the fluctuation of the 
density of signals ‘zero’ and ‘one’. With the typical 
system parameters assumed including A e = 9 X 10 “ 6 
cm 2 and with an ideal photodetector device ( 17 = 1 , 
r, = T f ), we obtain: 


sin( 0 / 2 ) ,_ 

K min = .--- ■ - ]jA e T f N min = 2, 


N min = 


4 k 2 0 

A e 7 f sin 2 ( 0 / 2 ) 


1.4X 10 9 cm 2 s '. 


(19) 


This evaluation is significant: it permits the infor¬ 
mation ‘reserve of robustness’ of a vision system to 
be determined. Daylight and thermal vision systems 
possess a firm ‘reserve of robustness’: the ratios 
N/N min are 1.6 X 10 8 and 2.7 X 10 9 , respectively. 
Therefore, some losses, and they can be appreciable, 
are permissible when registering such a great amount 
of quanta, N, which we will see below. In a night 
vision system, the ‘reserve of robustness’ is practi¬ 
cally exhausted: the excess of N over N mi „ is not 
even two orders of magnitude. There is nowhere to 
‘retreat’ in this case: no appreciable decreases in N 


are permissible here. It follows from the above state¬ 
ment that for the daylight and thermal vision systems 
an photosensitive element area of ca. 30 fi m can be 
called high-level and for night-vision systems as 
low-level. 

A conditional boundary between high and low 
levels, N 0 , can be introduced: in particular, it can be 
taken that at a high level ( N>N 0 ), one channel is 
capable of transferring not less than a byte of infor¬ 
mation. Then, 


2 16 kl 


A e Tf sin( 0 / 2 ) 


( 20 ) 


The priorities in C, namely, thermal vision, a 
daylight vision system, and a night vision system, 
are valid with the use of not only ideal photodetector 
devices but real ones as well. So with one and the 
same value of T ( N/(rft { N e ) = 10, the value C turns 
out to be in these systems 10, 11.5, and 40%, 
respectively [compare Eqs. (17), (13), and (15)]. 

Let us consider successively the main causes that 
lead to an increase of C in real systems. 


4. Spectral range and information capacity 

It is obvious that the spectral range of a photode¬ 
tector device must be accommodated with the spec¬ 
trum of radiation of the sources: the sun ( 0 . 2-2 
/tm), the stars (0.5-1.9 /xm), or the thermal radia¬ 
tion of the earth (3-40 /im). Under real conditions, 
the spectral range of a photodetector device is lim¬ 
ited, which is why only a part of quanta, N e , of the 
total radiation flux N turns out to be registered. Let 
a photodetector device be an ideal one concerning 
the rest of the parameters: the quantum yield 17 = 1 , 
the integration occurs in the course of the whole 
frame time, t, = T f . 

The technique for the calculation of the capacity 
C is obvious in this case. Integrating the Planck’s 
distribution in the interval A s -A m (Table 1), we find 
the N e , calculate the value log N/N e , and then the 
coefficient of the use of information capacity, C(A S - 
A m )/C(0-°°), i.e., the term in the parentheses of 
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Eqs. (13), (15), and (17). The values of N e and 
C(A s -A m )/C(0-°c) obtained with the use of such a 
technique are given in Table 1. There typical spectral 
ranges are for electronic systems of daylight vision 
0.45-0.95 gm (silicon), for thermal vision it is 3-5 
lim (InSb, PtSi, HgCdTe) and 8-14 gim (HgCdTe). 
In the examples under consideration, a sensitive 
element registers one third-one four hundredth frac¬ 
tion of the entire amount of quanta only, but owing 
to a ‘logarithmic influence’, the information capacity 
decreases only by 6 % to 26% in these cases. One 
more result unusual for developers of thermal im¬ 
agers follows from Table 1: the photodetector de¬ 
vices intended for the ranges 8-14 and 3-5 fi m are 
identical regarding the value C: the difference does 
not exceed 20 %. 


5. Dependence of information capacity on quan¬ 
tum yield, number of channels, and configuration 
of two-dimensional array 

Similarly, one can evaluate the losses in C, result¬ 
ing from the rest of the factors in Eq. (1). 

The quantum efficiency of intrinsic photoconduc- 
tive and photodiode detectors in the spectral range 
usually lies within the limits of tj = 0.3 to 0.9 and 
with such values of 17 , the coefficient to use for C 
for daylight and thermal vision systems is equal, 
according to Eqs. (13), (17), 

frf ■ 1 - °' |-O ' ii5iog o5^9 - o - 94 - 0 '" 5 - 

( 21 ) 

Thus, the question: how much the quantum effi¬ 
ciency of the detectors in these systems influences C 
can be answered: in fact not at all. 


Decrease of C is more appreciable with very 
small values of 77 , in particular in the thermal im¬ 
agers based on Schottky-barrier detectors. In such 
detectors, values of 17 = 10“ 2 are possible [ 6 , 11 ]. 
According to Eqs. (13) and (17), the coefficient 
becomes 0.77-0.8 in this case. 

Similar losses in C are observed in night vision 
systems, even if the photodetector devices with a 
relatively high value of 17 = 0.3-0.9 are used in 
them: 


c(v) 

C(l) 


( 22 ) 


Through the dependence C(f,), the influence of 
the number of picture elements (optical channels) 
and the detector configuration on the information 
capacity can be observed. It was noted above that for 
an ideal 2D array detector, t, = T f and for an ideal 
linear array detector, the ultimate possible integration 
time is t t = T ( /M t , i.e., it is determined by Eq. (7). 
The latter relationship was written for the simplest 
model of a square frame, M = M e X M e . Taking 
these values into account and according to Eqs. (13) 
and (17), we obtain for systems of daylight and 
thermal vision (for one and the same numbers of 
picture elements, M , in linear and 2D arrays): 


C(lin. array) 
C( 2D array) 


T f 

- (0.1-0.115) log— 


= 1 - (0.1-0.115) log A/ 

= 0.69-0.8 (23) 

The numerical comparison in Eq. (23) was con¬ 
ducted for linear arrays with a number of unit cells 
of 100 to 500 and for 2D arrays of 100 X 100 to 
500 X 500. Even if a developer replaces such a linear 
array with the 2D one (i.e., increases the number of 


Table 1 

Losses in information capacity under the limitation of spectral range 



System 







Daylight vis 

ion 


Night vision 

Thermal vision 



A s -A m ( /xm) 

0-se 

0.45—0.95 

The eye 

0-°c 

0-=c 

8-14 

3-5 

/V, N t (cm - 2 s“ ') 

2.3 X 10 17 

7.6 X 10‘ 6 

2.3 X 10 16 

10" 

3.8 X 10 18 

8.5 X 10 17 

10 16 

C (0,k) (bytes/frame) 

1.8 M 

- 

- 

0.5 Af 

2M 

_ 

- 

C(A s -A m )/C(0,=c) 

1 

0.94 

- 

' 

1 

0.94 

0.74 
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channels by a factor 100 to 500), then a gain by only 
20% to 31% will be obtained! It is unlikely that such 
efforts can be considered as reasonable ones. Thus, 
we have come to the next non-trivial result: when 
designing daylight and thermal vision systems with a 
criterion of information capacity, the perfect 
‘emancipation’ of the linear array occurs. 

However, a linear array is unfit for image transfer 
in night vision systems because there results too 
short a dwell time, it is M ~ 1 as great as that in the 
2D array [see Eq. (7)], intended for integration of a 
weak signal from each picture element. This is 
equivalent to the reduction of the average (over the 
frame time) flux of quanta by the same factor of M e : 




N 

~M~ e 


10 " 

100-500 


(0.2-1) X 10 9 cm -2 s '. 

(24) 


Here, N= 10" cm -2 s -1 [from_Eq. (14)], Af e = 
100-500. The calculated value of N turned out to be 
less than the permissible minimum. N min , from Eq. 
(19): for the linear array, the ‘reserve of robustness’ 
turned out to be insufficient! 

Hitherto, photodetector devices with the identical 
photosensitive areas A e have been compared. An¬ 
other model that is close to the real situation may be 
considered. Let a transfer of the image of an as¬ 
signed area A f , be required, and the system of 
switching followed by a signal-processing system 
limits the number of unit cells (photosensitive ele¬ 
ments) to some value, M e . Then a developer has two 
main options to construct the system. The first is to 
occupy the whole frame area, A f , with a 2D array of 
the number of unit cells equal to M e . In this case, the 
number of picture elements is limited by the ultimate 
possible number of unit cells, M = M t , with A e 
being equal to A f /A/ e [see Eq. (4)] and t■, = T f . The 
second option is: to scan the A f area with the linear 
array having the same number of unit cells, M e . This 
makes it possible to increase the number of picture 
elements considerably: for a square frame, M = M e 
X M c . However, the element area and the integration 
time decrease: A e = A f /M e 2 [Eq. (5)], t, = T s /M e 
[Eq. (7)]. As can be seen, in such a model, the 2D 
array cannot compete with the linear one at all: it 
yields to the latter in the number Af by a factor of 
M e and consequently, in the information capacity 


almost by the same factor of M e . ‘Almost’: because 
the values A e and t x in the 2D array are, however, 
higher. A precise calculation of the values C for 
systems based on the photodetector devices under 
consideration can be conducted with the use of Eq. 
(l). Substituting the said values of M , A e , r ( into 
this equation, we will obtain: for a vision system 
based on the 2D array: 

C {2D) = 0.415 M e log|-^-sin( /3/2)y/^-A f r f N e 
= 0.415 M e |log|^-sin( p/2)faA ( T { N e 

-ilogM e j (25) 

for that based on the linear array: 

Qn.array = 0-415 M 2 log (-i- S in( (3/2)faA f T f N e 

= 0.415 M e 2 jlog | y sin( /3/2) faA f T,N e 

— flog Af e J. (26) 

The ratio of these values is equal to: 

C (2D> w 1- (0.19-0.23) log M e 
C (lin.array) _ ^ 1 - (0.064-0.076)lOg M e 
*= A/ e (l — (0.13-0.15) log A/ c ) 

= (0.5-0.65) Af e . (27) 

The argument of the logarithm has been calcu¬ 
lated, naturally, for high-level systems: N e = 10 16 - 
3.82 X 10 18 at r) = 1. The frame area was taken as 
A f = 2 X 2 cm 2 , and the values T f and sin( j8/2) 
have not been changed. From Eq. (27), a value of the 
above-noted ‘almost’ is seen: with M e = 500-2000, 
the linear array surpasses the 2D one in information 
capacity if not by a factor of A/ e then by Af e /2. 


6. Electric capacity of charge storage 

. In all the cases analyzed above, finally, the amount 
of the effective quanta detected by a photosensitive 
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element, n s = T/A e fj/V e sin 2 ( p/2), was varied. The 
density of these quanta per unit of the element area 
in the systems of daylight and thermal vision consid¬ 
ered above changes within the limits 


4x0 

= 0.202X0.3-10 16 

500 

«5xl0'°cm“ 2 , (28) 



= 0.202 X 4.10~ 2 X 3.82 X 10 18 
*3x 10 16 cm" 2 . (29) 

The minimum density of quanta, Eq. (28), was 
obtained, in the present paper, for a linear array of 
photosensitive elements with the spectral range 3-5 
jum and for the frame format 500 X 500. The maxi¬ 
mum density of quanta, Eq. (29), was obtained for 
an idealized 2D array with the infinitely wide spec¬ 
tral range (N e = N). It was supposed that the pho¬ 
todetector device was capable of registering all the 
effective quanta, n s , incident upon a photosensitive 
element, therefore, it was n s that was used in the 
calculation of C with Eq. (1). 

Up-to-date photon focal-plane array devices in¬ 
clude charge storage cells. Are they able to accumu¬ 
late the charge corresponding to the said amount of 
quanta n s ? In order to answer this question, let us 
evaluate the maximum density of electrons that can 
be accumulated in the storage capacitor of a semi¬ 
conductor CCD or MOS unit cell: 


n ’ 


c" U _ 2 X 10 - 8 (1-10) 
q 1.6 X 10 ~ 19 


10 n - 10 12 cm -2 . 


(30) 


Here, q= 1.6 X 10 " 19 C is the electron charge. 
In order to simplify the calculation, the areas of a 
storage capacitor, a picture element, and a photosen¬ 
sitive unit cell are assumed to be equal. The specific 
value of the capacitance equal to c* = 2 X 10“ 8 
f/cm 2 is typical for a CCD, in a 2D array with MOS 
multiplexers it can be somewhat higher [12]. The 
taken interval of voltage variation, U = 1-10 V, is 
also typical for CCD and MOS 2D arrays. 


Comparison of Eqs. (28), (29), and (30), shows 
that the ultimate charge registered in a photodetector 
device is limited by the capacitance of a storage cell: 
only in a linear array with range 3-5 /im, it can be 
sufficient for the accumulation of the necessary 
charge. And in a 2D array at a high flux of quanta, 
n e , special measures have to be taken (in particular, 
division and skimming of charge, external frame- 
after-frame accumulation) in order to ensure the 
required dynamic range. Exactly because of the said 
limitation, in IR 2D arrays based on HgCgTe (8-14 
/un), one is still limited by a row integration (t i = 
Tf/M'). Note that in IR 2D CCD based on Schottky 
barriers for the range 3-5 /rm, n s decreases not only 
through a shift of spectral range, but also because of 
a small quantum yield (17 = 0 . 01 ). 

In previous sections, we noted that the informa¬ 
tion capacity of high-level vision systems can be 
influenced, although this influence is weak, by a 
number of factors: the type of a source (the sun, 
thermal radiation), the spectral range and quantum 
yield of a photosensitive element, the configuration 
of photosensitive elements (a linear or 2D array). But 
even a weak influence of these factors on C can be 
revealed only after the increase of the capacitance of 
a storage cell by many orders of magnitude: up to 
10 l 6 -10 17 electrons/cm 2 [Eq. (29)]. However, at 
present, the limitation in principle in the information 
capacity of high-level systems with focal-plane pho¬ 
ton detectors is set by the electric capacitance of a 
storage cell. This capacitance smooths the distinc¬ 
tions in the parameter C among vision systems and 
completely removes the influence of the factors men¬ 
tioned above: spectral range, level of sensitivity, and 
configuration, naturally, in real limits of their varia¬ 
tion. 

Thus, in all the considered systems with a high 
level of irradiation (n s > A t n ’), a unit cell of the 
photodetector device is able to accumulate a quantity 
of electrons equal to A e n" only. Therefore, in 
the initial relationship for C in Eq. (1), the num¬ 
ber of quanta detected by a unit cell, n s = 
■qt i A e N e sin 2 (/3/2), should be substituted by the 
quantity of accumulated electrons, A e n‘. This re¬ 
sults in 

C = 0.415 M logf ^A e n* 1 » M. 


(31) 
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The numerical evaluation was conducted for the 
above-assumed values of k 0 , A e , and also the mean 
value n * = 3.10 11 cm -2 [Eq. (30)]. 

It follows from Eq. (31) that with the limitation of 
a photocharge from each photosensitive element by 
the electric capacitance of a real storage cell, no 
more than one byte of information can be transferred 
through one optical channel for the frame time. This 
value was taken above as a conditional boundary N 0 
between low and high-level electronic vision systems 
[see Eq. (20)]. It is easy to find that n* = 
N 0 T { sin 2 (j3/2). 

In the very beginning of this paper, just after 
having written the relationship for C in Eq. (1), we 
saw its principal feature: C is proportional to M and 
the rest of the parameters weakly influence the value 
C. The further analysis has only defined the propor¬ 
tionality factor, i.e., the number of bytes per picture 
element more precisely. For typical parameters of 
vision systems, the information capacity of one ele¬ 
ment does not exceed 2 bytes (thermal vision, an 
ideal photodetector device [Eq. (18)]). The daylight 
and thermal vision systems with real photodetector 
devices yield in C to the systems based on ideal 
detector devices by about a factor two (for the 
identical values of A e and M) and with the format 
of 500 X 500 = 2.5 X 10 5 , they are able to transfer 
the quantity of information per frame equal to 
C = 2.5 X 10 5 bytes. (32) 

Accordingly, in high-resolution vision systems, 
M~ 1000 X 1000 elements per frame, the capacity 
C reaches values on the order of 10 6 bytes. It is 
pertinent here to repeat the conclusion: it is unrea¬ 
sonable to waste efforts for improvement of the unit 
cell to enhance information capacity only: even if 
increasing c* by a factor of 10 6 , we increase C only 
by a factor of two, being the theoretical limit. Thus, 
the main way to increase C consists in an enlarge¬ 
ment of the format. 

The night vision system is different. In this low- 
level system, the flux of quanta is small even for the 
whole frame time (t i = T f ), 

■j-=N e T f sin 2 (/3/2) = 10" X4X 10~ 2 X 0.202 

= 8 X 10 8 cm -2 (33) 

therefore, the capacitor of a unit cell is able to 
accumulate such a number of electrons [Eq. (30)]. 


Thus, the conditional division of electronic vision 
systems into low and high-level ones [Eq. (20)] 
acquires an additional physical sense: in up-to-date 
focal-plane arrays with photon sensitive elements 
under high-level operation mode, the photocharge 
integrated for the frame time overfills the ‘well’ of 
the CCD or MOS storage cell. In these cases, when 
designing electronic vision systems, special measures 
are being taken to restrict the irradiation or the 
photocharge. 

In summary: a limited storage cell does not make 
it possible to realize completely the capabilities of 
daylight and thermal vision systems [(1.8-2)Af] in 
information capacity and decreases this capacity to 
values of M [Eq. (31)]. As a result, in the value C, 
these systems approach night vision systems, for 
which C« (0.4-0.5)M [Eqs. (15) and (22)]. 

7. Electronic vision systems and detection 

Detecting systems are usually characterized by 
their sensitive parameters, in particular, the noise 
equivalent power (NEP). The requirements for a 
photodetector device when designing an electronic 
vision system based on the criterion of the achieve¬ 
ment of the maximum value of C turn out to be less 
demanding than those based on the criterion of the 
minimum NEP. That same logarithmic dependence, 
C~\nK, is ‘guilty’ here: it damps the losses in 
signal-to-noise ratio while those in NEP turn out to 
be exactly the same as those in K (NEP ~ 1 /K). In 
order for the maximum value K to be ensured in 
detecting systems, strict requirements have to be set 
for the quantum yield: in fact, its decrease from 1 to 
0.3 results in a change for the lowest NEP value by 
80% (NEP ~ 1//^= \/s\f03 = 1.83). It should 
be recalled that C gets lower by ca. 6 % only, in this 
case [see Eq. (21)]. 

In detecting systems, linear arrays tend to be 
substituted by 2D arrays: th eoretically , this improves 
NEP by a factor of {M e = v'l 00-500 = 10-24 [1,7]. 
In high-level systems, such a rather complicated 
substitution is not necessary [see Eq. (23)]. More¬ 
over, there are situations described above where the 
linear array considerably surpassed the 2D array. 

The spectral range is the third factor considered 
above. We have seen that the transition from the 
range 3-5 /xm to 8-14 /im almost does not increase 
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the information capacity although the effective num¬ 
ber of quanta rises by a factor of 85 (see Table 1). 
With the use of IR detecting systems: thermal search 
and tracking systems and thermal imagers, an object 
very weakly overheated against the background could 
be detected. Since K ~ with the said increase in 
the number of the effective quanta, the signal-to-noise 
ratio increases by a factor ]f&5 ~ 9, and as a conse¬ 
quence, the rise of temperature sensitivity, i.e., the 
minimum difference in temperature equivalent to the 
noise, can be expected to be ca. 9 also. However, 
precise calculation shows that this improvement is 
equal to a factor of 5.5 [6]. 

If both requirements, namely, to obtain the mini¬ 
mum NEP and the maximum C with the spatial 
resolution (or the photosensitive element area A c ) 
fixed, must be satisfied, then a photodetector device 
has to be designed following the stricter criterion of 
minimum NEP. 

The conclusions made above in this section were 
formulated for the detector device with unlimited 
capacitance of the storage cell. Taking into account 
the fact that this capacitance is finite, distinctions in 
approaches to designing of the focal-plane detector 
devices for vision and detecting systems disappear. 
So, we have seen that taking this into account the 
distinctions between spectral ranges is reduced: de¬ 
tector devices intended for 3-5 and 8-14 /tm ensure 
one and the same value of C. The electric capaci¬ 
tance of a unit cell in photon 2D arrays limits the 
ultimate accumulated charge, i 7 A e r ( iV e sin 2 ( j8/2) ^ 
A c n* [Eq. (30)], and hence it limits the maximum 
signal-to-noise ratio, Kk 0 <. '/n r . Therefore, it can 
be expected that the transition from range 3-5 /tm 
to that of 8-14 /Am will not result in any improve¬ 
ment of not only C but the temperature sensitivity as 
well. The precise calculation shows that such a 
transition with a limited capacitance of the storage 
cell even reduces the temperature sensitivity [6]. 


8. Quantity of information and features of objects 
for observation 

Up to now, we have conducted the comparison of 
the capabilities of electronic vision systems in infor¬ 
mation capacity, proceeding from the only condition: 
the photon exitance of each element of an object can 


vary from zero to the maximum value, N e . Below, 
the influence of some real features of objects and 
observation conditions on information capacity are 
discussed. 

The image in daylight and night vision systems 
being formed by the reflected external radiation, 
under definite conditions (diffusive reflection, homo¬ 
geneous irradiance, etc.), the exitance of an object, 
N(x,y), is linearly related to the spatial distribution 
of its reflectance averaged over the spectral range of 
vision system, R( ;t,y): 

N(x,y) = R(x,y)N e . (34) 

In thermal vision, the image is formed by the 
intrinsic radiant emission of an object first of all. 
Under some specific conditions (uniform tempera¬ 
ture of an object, a negligible small level of thermal 
contraradiation from the atmosphere and environ¬ 
mental objects [4]: these factors are analyzed below), 
the exitance N(x,y ) is linearly related to the com¬ 
plement of R, the emissivity e = 1 — R: 

N( x,y) = e( x,y)N e = N e -R( x,y)N e . (35) 

As is seen, in both cases under consideration, the 
information fields for the image and the reflectance 
coincide completely. In this case, the sensitivity of 
vision system to the contrast SR/R equals that to 
the contrast 8N t /N c . 

In real situations, the total information potential 
of vision systems often remains to non-used because 
the range Ae = A R is smaller than the total percent¬ 
age range. In addition to that, the contrast depends 
on the spectral range. So, it is known [4] that the 
mean value of reflectance for visible radiation from 
natural objects is 10.24%. In the infrared image of 
natural objects, contrasts usually decrease with in¬ 
creasing the wavelength. In particular, for the spec¬ 
tral range 3-5 /im, values e = 0.7-0.9 are typical 
and for that of 8-14 /im, e>0.9 [13]. For the 
calculation of the information capacity of sceneries 
with a limited range of changing A/?, Eq. (1) and all 
the expressions following from it should be modi¬ 
fied: A R should be entered under the logarithm 
symbol, i.e., it should be taken into account that the 
possible number of gradations in each element is 
now equal to A RK. According to these relationships 
and Table 1, the decrease in contrast when moving 
from the range 3-5 /im to that of 8-14 /im is 
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completely overcome by the increase of the intensity 
of the thermal radiant emission: Ae decreases by a 
factor of about 2 to 3 and becomes ~ 9 times 
larger. With the said range of variation of Ae= A R 
~ 0.1-0.3, the information capacity decreases by ca. 
5% to 10% as compared with the value calculated 
for the whole range, AR = Ae = 1. 

To some reduction of the contrast of thermal 
images, thermal contraradiation leads as well, in 
particular the contraradiation of the atmosphere con¬ 
tributing substantially to the thermal balance of the 
earth [4,7,8]. The reflected contraradiation of 
R(x,y)aN e flux density is added to the intrinsic 
thermal radiation of objects (the factor a shows what 
fraction of the object radiation amounts to the con¬ 
traradiation: under averaged conditions on the earth 
for the long-wavelength range, a * 0.8276). Adding 
the two components and taking into account the 
relationship R = 1 - e, we obtain for the thermal 
image: 

N(x,y) =e(x,y)N' + [l - e( jc,;y)]aN e 

= e(jt,y)(l -a)N e + aN e . (36) 

The contraradiation aN e leads to not only the 
appearance of the ‘pedestal’ but to the decrease of 
signal in the image conditioned by variations of e by 
a factor of 1/(1 - a) [4] [compare Eqs. (35) and 
(36)]. This is explained by the fact that both consid¬ 
ered components of radiation compensate for each 
other: in a ‘dark’ point, where the emissivity, e, is 
small, the share of intrinsic radiation is less, but the 
share of reflected radiation is higher because the 
reflectance, R = 1 — e, is greater in this case. In a 
‘light’ point, at a greater, e, the intrinsic radiation 
rises, but the reflected one decreases. Now, into Eq. 
(1) and the equations derived from it, the variation of 
not the ordinary contrast Ae = AR but the effective 
contrast (1 - a)Ae should be entered. 

In the limit when a approaches unity, both the 
radiations, intrinsic and contra, completely compen¬ 
sate for each other. With such a balance, the image 
stops to be dependent on e( x,y). Actually, it stops to 
be an ‘image’, turning, with lack of temperature 
contrast, into the homogeneous background, N(x,y ) 
« N e [Eq. (36)]. 

In Ref. [4], two typical situations were described: 
propitious and inconvenient for an observer. The 


propitious situation is the observation of the surface 
of the earth from above the atmosphere in the spec¬ 
tral range 8-14 /im with no clouds. In this case, the 
contraradiation of the atmosphere can be neglected 
and so, the contrast in an image practically com¬ 
pletely reproduces the contrast of objects in Ae, 
which was shown in the beginning of this section 
[Eq. (35)]. The other situation is the observation in 
the same spectral range of vertical objects situated 
near the surface of the earth when the contrast and, 
consequently, the number of distinguished gradations 
decrease by a factor of about 6, (1 - a v ) = 0.167 [4]. 
However, the additional losses in information capac¬ 
ity even in such an inconvenient case will be ca. 8% 
only, which is explained by ‘the logarithmic effect’, 
C~ log(l - a v ). 

Similarly, the dependence C on other known fac¬ 
tors that influence n e can be observed, namely, the 
absorption in the atmosphere, daily time, the season 
and the geographical situation. But the system re¬ 
mains a high-level one, the dependence C(n e ) will 
have been smoothed and the information capacity of 
one picture element will have been of the order of 
one byte. 

In the analysis of an image and the features of an 
object, not only quantity but also the character of 
information can be important. In the information 
massif C, the information is contained concerning 
both the number of picture elements M over the 
space R(x,y) and the number of the gradations of 
brightness (levels of contrast), K, in each element. 
The possibility of a considerable increase in C for 
vision systems on account of some increase of M 
was mentioned above. However, in a number of 
problems, the minimum possible contrast, SR/R, 
must be registered while the number of picture ele¬ 
ments, M, can be relatively small. In this case, it is 
purposeless to enhance the resolution of vision sys¬ 
tem over the required value M , but one should try to 
obtain the ultimate sensitivity: the system should be 
designed following the criterion NEP min but not the 

C max- 

In general, the number of picture elements M can 
be ‘changed’ by that of gradations K, simultane¬ 
ously increasing the sensitivity of a system to the 
contrast of irradiance, too. In order to perform this, 
the element area, A e , should be increased, then the 
value of K rises [Eq. (8)] and the number M falls: 



/./. Taubkin, MA. Trishenkov /Infrared Physics & Technology 37 (1996) 675-693 


687 


according to Eq. (3), with the frame area, A f , to be 
constant, we obtain M ~ 1 /A e . 

It is obvious that in the limit, both the information 
capacity and the NEP of a vision system are limited 
by the fluctuations, which cannot be removed in 
principle, of the flux of photons that form the optical 
image in this system. It is convenient for an ideal 
vision system to be characterized by the specific 
number, K *, of the gradations distinguished by it, 
bringing K to the unit values of A e , /,, sin 2 (/3/2), 
k 0 . Normalizing Eq. (8), we get, for tj= 1, F= 1: 


/ViSin 2 ()3/2) 




(37) 


The specific quantity K *, like the specific quanti¬ 
ties of the detectivity, D ’, and the noise equivalent 
temperature difference, NETD*, [4] are determined 
by features of an object and conditions of observa¬ 
tion. In the given case, K * is the ratio of the 
maximum specific signal from an object, N e , to its 
noise, -Jn^. This value, N t /{N~ t = is also called 
the dynamic or information potential of an image 
[3,4], As is seen, even an ideal electronic-optical 
system realizes the potential inherent in an object in 
some measure only. In this case, its efficiency, 
sin 2 ( >3/2) A e fi [Eq. (37)], is determined by the col¬ 
lection of radiation with the objective and the photo¬ 
sensitive areas as well as by the accumulation of 
photocharges in the unit cells. 

The result turned out to be interesting and unex¬ 
pected comparing diverse vision systems regarding 
the information potential K ’ when observing the 
moon. It is known 7% of the sun radiation incident is 
reflected from the moon, on average [7]. The rest of 
the incident sun energy is absorbed by the moon 
surface and then remitted in the infrared range. In 
this case, because of a low thermal conductance of 
the surface of the moon, the remission mainly occurs 
from the sunlit side, which is heated to 413 K [7]. 
Thus, the intrinsic long-wavelength radiation from 
the sunlit side of the moon is 93/7 « 13 times as 
large as the reflected sunlight in power and the 
number of IR photons is 13 X (5785 K/413 K) = 186 
times as large as that of short-wavelength photons. 
The contraradiation for the horizontal surfaces on the 
moon, which practically has no atmosphere, is close 
to zero. As a result, the information potential K * 


even of the sunlit side of the moon in the IR range 
turns out to be substantially greater than that in the 
visible range. 

The increase of the information potential of an 
optical image resulting from the transformation of 
radiant power into an long-wavelength range appears 
to be a paradoxical phenomenon: usually, any trans¬ 
formation is accompanied by loss of information. 
However, in the given case, the transformation un¬ 
dergoes a kind (a channel) of information transfer. 

Probably, the long-wavelength range surpasses 
the visible one in the potential K * also in remote 
optical sounding of the solar system planets. 

It was first shown in Ref. [4] that in the informa¬ 
tion potential of the image, thermal vision under the 
condition of thermal balance of the earth does not 
yield to the human daylight vision (‘the rule of 
information potentials’). The density of the quanta of 
long-wavelength radiation under the condition of 
thermal balance of the earth [7,8] is about T S /2T E 
times as high as that in the short-wavelength range: 
the factor of 1/2 is due to the fact that the sun 
quanta are absorbed only on the day illuminated 
hemisphere and the long-wavelength ones are emit¬ 
ted from the whole surface of the earth. Besides, in 
the range 8-14 fi m, the factor of the use of the 
long-wavelength-radiation quanta (0.230) is almost 3 
times as high as this factor is in the visible spectral 
range (0.082) [4]. 

However, owing to the addition of the intrinsic 
thermal radiation of vertical objects and the contrara¬ 
diation of both the atmosphere and the earth reflect¬ 
ing from them, the image in the spectral range 8-14 
/im is formed, as was mentioned above, only by the 
(1 — « v ) = 0.167 fraction of the quanta received. As 
a result of the mutual compensation for the above 
factors, the information potentials of visible and 
infrared images turn out to be equal. 

In this case, the known advantages of observation 
in the IR range are a weak dependence of the 
information potential on the time during the day (it 
should be reminded that the irradiance near the earth 
surface can change by 9-10 orders of magnitude 
within this time interval) as well as a smaller dissipa¬ 
tion of radiation in mists, fogs and under man-made 
hindrances. It is obvious that these advantages of the 
thermal vision are valid for the systems of daylight 
electronic vision as well. 
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The conclusion on the equality of the information 
potentials of visible and IR images under earth con¬ 
ditions is a principle one in our view: in the informa¬ 
tion volume, the thermal vision can reach the level of 
the daylight vision, which is known to ensure for 
people more than 80% of all the information re¬ 
ceived. Thus, ‘the rule of information potentials’, in 
a large measure, determines progress possibilities for 
practical thermal vision [4], 

The calculations in this paper including the 
derivation of Eqs. ( 8 ) and (37) a re perf ormed for the 
maximum level of noise, n„ = /Fr/n e , identical for 
all channels. Theoretically, the algorithm of the sys¬ 
tem can be complicated: one can determine ‘an 
individual’ noise for each channel depending on the 
sign al lev el in this channel, 17 /t, taking it to be equal 
to -jF-qn . It means that after the transition to the 
specific values and for tj = 1, F = 1, over a segment 
of AN length, AK * =d N/yfN gradations can be 
distinguished and the specific number of the grada¬ 
tions distinguished by, the system is: 

c n. d N — 

k '=Ltn~ 2 ^- (3S) 

Comparing Eqs. (37) and (38), one can see that 
the potential of the image, K *, with the mentioned 
adapted processing algorithm increases by a factor of 
two, i.e., as if the effective quantity of photons, N e , 
rises by a factor of four. It was repeatedly noted 
above that C very weakly depends on N e , from 
which follows that a similar complication of the 
algorithm for some increase in C to be obtained is an 
unlikely expedient. But concerning detecting sys¬ 
tems, where it is required to enhance K *, the 
above-described adapted algorithm is applicable. A 
more simple intermediate algorithm is used as well: 
for the ‘darkened’ fragment of a frame with a rela- 
t ively small level n' e , the identical value of noise, 
\jF-qn e , is assumed. Then the number of distin¬ 
gui shed gr adations in this fragment rises by a factor 
of ]jn e /n' e . 

Also other algorithms of interelement as well as 
interframe image processing are worth to consider 
additionally. 

In the general case, an image depends not only on 
R or e but on other characteristics of objects and 
observation conditions. For example, at a non-uni¬ 


form temperature, T, of the objects in a frame, onto 
their infrared image in thermal imagers conditioned 
by a variation of e(x,y), the image is put arising on 
account of a spatial variation of T. In the linearized 
model for small \T « T, 

AN 

&N(x,y) = Ae(x,y)(l - a)N + ——AT 
AT 

= Ae(x,y)(l -a)A e + £A77V e . (39) 

Here, £= (1 /N e )AN e /AT e ) is the relative tempera¬ 
ture factor of the thermal radiant emission in the 
range 0-A m . According to Ref. [ 6 ], it is equal to: 

f-^Um + I+O. (40) 


where x m = hc/kTX m and £ is some correction: 


1.6jc m + 4.8 
xl + 2x m + 2A' 


(41) 


In order for the calculations to be simplified, the 
correction £ can be assumed to be equal to its 
maximum value for the range A •« A m . So, at A m = 
14 /xm, it is 0.4. Then the error in calculation of £ 
from Eq. (40) will not exceed 2-3%. For the range 
0 - 3 °, one can take £ = 2 , this will result in an error 
of ~ 30%. For the range 8-14 /xm according to 
Eqs. (40) and (41), we will obtain £ = 0.017 K _ 1 . 

Eq. (36) makes it possible to evaluate the mini¬ 
mum value Ae min that an imager with known tem¬ 
perature sensitivity, NETD, can register: 

£ 

lAelmin = |AR| min = -A T = -—NETD 

1 - a v 1 - a v 


0.017X5(0.04-0.1) 

0.167 


= 0.02-0.05. 


(42) 


Here, A T = A 0 NETD was substituted. The numerical 
evaluation was done for the range 8-14 /xm, NETD 
= 0.04-0.1 K, and for the observation of small-size 
vertical objects situated near the earth surface [4], 
For the natural formations on the earth, the typical 
values of e being equal to =0.9-1 [13], the men¬ 
tioned thermal imager is able to distinguish two-five 
gradations Ac, A R under the conditions assumed. 

Thermal imagers with NETD > 0.1-0.2 K, under 
these conditions, do not perceive the picture condi- 
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tioned by the variations of the emissivities of small- 
size vertical objects and can be used for revealing 
somewhat higher or lower temperature of these ob¬ 
jects only [4]. Also shown in Ref. [14] is that the 
ultimate value of NETD for the ‘staring’ thermal 
imagers sensitive in the spectral range 8-14 ju,m is 
close to 10 “ 4 K. 

Thus, the information capacity in all the consid¬ 
ered situations can be taken to be a universal param¬ 
eter showing not only capabilities of a vision system 
but also a measure of proximity of a particular 
equipment to the theoretical limit. 


and, consequently, the logarithm itself tends to zero. 
In the second case, with A e —> *=, both the numerator 
(~ log/Aj) and the denominator (~ A e ) rise, but the 
former rises more slowly. And as a consequence of 
such a non-monotonic dependence, some optimum 
value opt A c has to exist for which the capacity C * 
is maximum. Having the derivative dC ' /d A l , being 
equal to zero, we arrive at the following value of 
In K : 


In tf=ln — sin|yjy jcpl A t T f N e =-. (44) 


9. Ultimate specific information capacity. Rule of 
binary channels 

Let us express the number of picture elements M 
in Eq. (1) in terms of the areas of an element, A e , 
and the frame, A f , (A/ = A f /A e ): 

C-0.415A log [i sin (|)^|^]. (43 ) 

The dependence of C on all the parameters of the 
problem is monotonic (except for the area of a 
picture element, A t ), and in order for C to be 
maximized, it is necessary to decrease the values of 
k 0 , F and to increase 17, t„ A e , j3. The ultimate 
values of these parameters were already presented 
above: k 0 = 5 (with a permissible error probability 
of ca. 10“ 9 ), t l = T f = 4 X 10 -2 s (for systems with 
the TV frame time); sin/3 = 0.45 (with D:f= 1:1); 
F = 1.37 (for idealized daylight- and thermal-vision 
systems) and F = 1 (for a night vision system); 
17 = 1; the values of N e are given in Table 1. These 
values will be used below in the calculation of the 
ultimate value of information capacity. The parame¬ 
ter A f can generally be eliminated considering the 
specific capacity (per unit of frame area), C" = 
C/A f , is introduced. 

Now it remains to consider the dependence C(A e ) 
and to determine the optimum area of the picture 
element, A e . 

According to Eq. (43), the information capacity C 
tends to zero with both a small element area and a 
large one. In the first case, the argument of the 
logarithm at some finite value of A e tends to one 


It follows from Eqs. (43) and (44), that the spe¬ 
cific information capacity of electronic imaging sys¬ 
tems, with the set value of N e , rises with the density 
of the optical channels (with a decrease in dimen¬ 
sions of the elements or with improvement in spatial 
resolution) up to where the number of gradations in 
each channel becomes about two and, hence, the 
capacity of each channel amounts to one bit. 

This result is important theoretically for multi¬ 
channel electro-optical systems of information trans¬ 
fer and can be formulated as ‘the rule of binary 
channels’: The information capacity of an electronic 
vision system is maximum with the ultimate spatial 
resolution, i.e., in the case when the amount of 
quanta that is necessary for one bit of information to 
be transferred is incident on one picture element for 
the frame time. 

As is shown below, ‘the rule of binary channels’ 
is valid under a relatively low irradiance when the 
radiation that is necessary for the transfer of one bit 
of information is incident on a photosensitive ele¬ 
ment with a size larger than the diffraction limit. 

In accordance with Eq. (44), the maximum of C * 
is attained with the area of the photosensitive ele¬ 
ment equal to 


optA e = (opt / e ) 2 = 


klFe 

T f N e sin 2 ( 13/ 2) 


8375 F 
V e 

(45) 


Here, e is the Napierian base. The condition of Eq. 
(45) determines the minimum quantity of detected 
photons, (n s ) min = k 2 0 Fe, as well. 
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For the systems of daylight and thermal vision 
with N t = (1-382)X 10 16 cm -2 s _l (see Table 1), 
we obtain: 

opt/ e = /optA e * 0.0005-0.1 pm (46) 

and for night vision with N e = N= 10" cm -2 s~\ 
opt/ e - ]/optA e - 2.9 pm. (47) 

The attainment of the extreme of C * (opt A e ) is 
not a success with a great flux of quanta: it is 
impossible to obtain so high resolution as 0.0005- 
0.01 pm. It is limited by the diffraction of the 
objective by the circle of confusion, / conf [15]: 

(48) 

Thus, for the calculation of the ultimate value of 
the specific capacity C *, we get two relationships. 
With a small flux of photons (for opt/ e > / conf ), into 
Eq. (43), A e = opt A e from Eq. (45) should be sub¬ 
stituted: 

C 1 0.09 N c 

C A f 16optA e ln2 optA e N * 
where 


l6k 2 0 F In 2 
T ( sin 2 ( /3/2) 


= (0.93-1.27) X 10 s <bytes~ l s~'. 

Hence for a night vision system with the typical 
value N e = 10" cm -2 s -1 , we obtain: 


C* = 10 6 bytes cm 2 . (50) 

Under a great flux of photons, A e * (/ con f) 2 from 
Eq. (48) should be substituted into Eq. (43): 


0.61 A 



(51) 


A definite difficulty in the use of the last relation¬ 
ship consists in the choice of the wavelength A for 
the calculation. The greatest value of C * appears to 
be attained with the infinitely wide spectral range, 
when N e = N. However, with A -> <», we get C ’ ~ 
1/A 2 —>0. Therefore, it is reasonable to determine 
the C* for real spectral ranges, assuming the long 


wavelength cutoff of this range, A m , to be as the 
value A calculated. Substituting the quantities A = 
A m , N e for the ranges 0.45-0.95, 3-5, and 8-14 
pm, according to Table 1, we will obtain: 

0.81-1.23 1 

C* =- - 2 - = ^7 ± 25%. (52) 

Now we explain the above-obtained Eqs. (49) and 
(52). As was already noted above, an electronic 
vision system, when forming an image signal, should 
transfer at least two brightness gradations, K > 2. 
Hence, the evaluation can be obtained for the mini¬ 
mum permissible signal-to-noise ratio in each chan¬ 
nel as well: for small, as compared with the resolu¬ 
tion of a system, objects, in accordance with Eq. ( 8 ), 
it should be not less than Kk 0 = 2X5= 10. The 
precise value of the signal-to-noise ratio is taken 
from Eq. (44). It happens to be close to the qualita¬ 
tive estimation: 

lnAT=^, K = e 0,5 = 1.65, 

2 

Kk 0 = e 0 5 X 5 = 8.25. (53) 

As is known, [1] for extended objects under visual 
observation or inside-frame summing up, k 0 de¬ 
creases to unity. 

The set signal-to-noise ratio can be ensured if the 
number of photons incident upon the photosensitive 
area for the frame time is not less than the above- 
calculated value (n s ) min = k\Fe [Eq. (45)]: this 
amounts to 70-90 photons per element for small 
objects and to 3-4 for extended ones. 

In Section 3 for the fixed area (A e = 30 X 30 
pm) and the typical values of other parameters of 
the system, the minimum density of quanta emitted 
by an object, N^ n [Eq. (19)], was calculated. Now, 
for the set density N e , the permissible size of the 
area, opt A e ~ 1 /N e [Eq. (45)], is found: for a smaller 
area, no required quantity of photons will be col¬ 
lected. In a frame of unit area (A f = 1 cm 2 ), M = 
l/optA e picture elements (channels) will be ar¬ 
ranged. With transferring two brightness gradations 
(K = 2), the capacity of one channel will amount to 
B = log 2 K = 1 bit and the capacity of all the chan¬ 
nels will be C* = M= l/optA e bits or C* =A //8 
= 0.125/opt A e ~N e bytes, being the qualitative 
derivation of Eq. (49). 
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According to Eqs. (45) and (49), with increasing 
the density of the flux N e , the required area of 
elements, A e , decreases and both the number of 
elements arranged on a unit of the frame area, M = 

1 /A c ~N e , and the specific information capacity, 
C* ~ 1 /A e ~ iV e , increase correspondingly. 

Following Eq. (46), at a very high flux, N t , the 
required signal-to-noise ratio is ensured even with 
very small sensitive areas of elements, which are less 
than the size of the circle of confusion. Then the area 
is limited by this size and A e ~ A 2 [this is a qualita¬ 
tive evaluation based on Eq. (48) with f:D ~ 1:1]. In 
the case under consideration, the number of the 
charge carriers accumulated in a channel and the 
number of brightness gradations are great, which is 
why the registration of the number K needs a capac¬ 
ity on the order of one byte. Since M - 1 /A e ~ 1/A 2 
picture elements (channels) go in the frame area 
equal to a unit of area (1 cm 2 ), the capacity of the 
system (with the capacity of a channel equal to one 
byte) does turn out to be equal to the number of 
channels, C* == Af = 1 /A 2 , being the physical 
essence of Eq. (52). 

With the use of Eqs. (49) and (52), the ultimate 
values of specific capacities for all the vision sys¬ 
tems under consideration were calculated, and are 
presented in Table 2. Also the sizes of the photosen¬ 
sitive elements determined from Eqs. (45) or (48) are 
given. As is seen from Table 2, the maximum spe¬ 
cific value C * is attained at a high level of irradia¬ 
tion and the shortest wavelength, i.e., in the daylight 
vision system. In a thermal imager for the range 3-5 
^m, it is ~ (5/0.9 ) 2 ~ 30 times worse and for 8-14 
/xm, it is lower by a factor of ~ 170 [the ratio of the 
wavelength squares amounts to ca. (14/0.9) 2 « 240 
times]. 

Note that being distinct in Eq. (52), in Eq. (49), 
C* does not depend on A. This leads the value N t 


corresponding to the boundary between the fields of 
application of Eqs. (49) and (52) to decrease with 
increasing wavelength of radiation. 

One more consequence follows from the relation¬ 
ship C* ~ 1/A 2 [Eq. (52)]: under the limitation of 
the sizes of photosensitive elements by diffraction 
and at D/f~ const, with increase of wavelength, 
the area of the objective, A ob (A ob ~ D 1 ~f 2 ~ A 2 
~C/C * = A f ), and the volume, V, of the optical 
system (V ~ D 2 f~ A 3 ) have to be increased. Only 
then, the same resolution capability will be ensured 
in long-wavelength systems with a somewhat greater 
value of K{K~{A e [Eq.( 8 )]}. 

Realization of the said high capabilities of the 
daylight vision system, i.e., the attainment of the 
values C* ~ 10 8 bytes/cm 2 , is rather problematical 
because it is problematical to produce cells with 
micron dimensions. And cell dimensions of 10 to 20 
/Am are attainable with the present day level of 
technology. Therefore, the value C* = 6.2X 10 5 
bytes/cm 2 given in Table 2 for the range 8-14 /Am 
is a real ‘landmark’ for developers of electronic 
vision systems for not only the long-wavelength 
spectral range but for the visible one as well. This is 
confirmed by the evaluation of C * for real photode¬ 
tector devices with such dimensions of unit cells. So, 
with A f =lXl cm 2 and \[A~ e = 17 /Am, we get 
Af = 580 X 580 = 3.4 X 10 5 and according to Eq. 
(31): 

CM 

C" = — = — = 3.4 X 10 5 bytes/cm 2 . (54) 

A f A f 

The double losses relative to the theoretical limit 
(C* « 6.2 X 10 5 bytes/cm 2 ) are due to a limited 
electric capacitance of a storage cell for the photon 
detectors. 


Table 2 

Ultimate values of specific information capacity 



System 




Daylight vision 

Night vision 

Thermal vision 


Spectral range (/Ain) 

0.45-0.95 

0.3-1.8 

3-5 

8-14 

N, N t (cm -2 s~ ') 

7.6 X 10 16 

~ 10" 

10 16 

8.5 X 10 17 

4 0 „f. 4 ( M®) 

1.1 

2.9 

6.1 

17.1 

C* (bytes/cm 2 frame) 

10 8 

~ 10 6 

3.3 X 10 6 

6.2 X 10 5 
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10. Electronic vision systems and the eye 

Now it is interesting to compare the considered 
electronic vision systems with the human vision, 
following the criterion of information capacity. Us¬ 
ing Eq. (1), we will find the value C for the eye 
under a daylight illumination and in the black-white 
approximation. In Ref. [4], on the base of the analy¬ 
sis of data published, the following parameters of the 
eye as an optical device were used: the view angle, 
12 °; the quantum efficiency, 17 = 0 . 01 ; the diameter 
of the cones, / A,~ ~ (1.5-6) X 10 -4 cm; the integra¬ 
tion time, fj = 0.1 s. Also the parameter N e changes 
relative to that for electronic vision systems: the flux 
of quanta is calculated for the photopic vision curve 
of the eye in the spectral range 0.4-0.7 /im and 
equals 2.33 X 10 16 cm -2 . Only the value k 0 = 5 
remains the same. According to Eq. (1), with the 
said values of the parameters, we obtain: 

C = 0.415 M log X 0.1025 X (2-6) 10 " 4 
X / ^j-2.33 X 10 16 " ] = (1.37-1.57) M. 

(55) 

This relationship places the eye into one line with 
high-level electronic vision systems: if the systems 
with an identical number of picture elements, M, are 
compared, then the eye surpasses the systems with 
real photodetector devices in information capacity by 
30% to 60% [Eq. (31)], but it yields to a hypothetical 
thermal imager with an ideal detector device by ca. 
20% to 30% [Eq. (18)]. If the number of picture 
elements for the eye is taken to be equal to the 
number of the nervous fibres going out of the eye, 
ca. 10 6 , then in information capacity, with the eye, 
the high-resolution systems level if their format is 
M > 1000 X 1000. Scanning systems based on linear 
arrays with M e = 2000 to 5000 have a value C as 
high as to surpass the eye. 

The ultimate value of the information capacity of 
the image and its dependence on A [Eq. (52)] ob¬ 
tained above are one more argument in the discus¬ 
sion on whether the spectral ranges of the human 
vision and the vision of animals are optimum. For¬ 
merly, it was assumed that these visions are not 


thermal ones, at least, because detectors for the 8-14 
/itm range had to operate at a low temperature and 
nature ‘has not been able’ to create any cooled eye. 
But it was shown later that at room temperature, in 
principle, radiation could be registered with a sensi¬ 
tivity close to the radiant limit. Theoretically, ther¬ 
mal photodetectors in particular, are able to do this 
[16]. Thus, the spectral range of sensitivity of the eye 
can hardly be related to problems of cooling. From 
the point of view of the results obtained in this 
paper, the expedience of vision in the spectral range 
0.4-0.7 /xm can be explained by a high specific 
information capacity. The diameter of the cones in 
the human eye (to 1.5-2 /rm [13]) is close enough to 
the diffraction limit [Eq. (48)]. In this case, the value 
C * in the visible range turns out to be higher by two 
orders of magnitude than that in the IR range (8-14 
jum) [Eq. (53)]. The gain in minimizing the volume 
of an optical system turns out to be still larger, at 
least by one order. 

Fragmentary processing of the photosignals from 
the cones in the ganglion cells does not appear to 
lead to any essential decrease in information capacity 
of the vision [Eq. (38)] and the combination of the 
rods into groups ensures the necessary increase of 
photosensitive areas under a low illumination [Eq. 
(45)]. Lastly, some features the vision agree with the 
two-gradation mode ensuring the maximum value of 
specific information capacity [Eqs. (43) and (44)]. 

So, the universal principle of nature: to arrive at 
the decisions optimum from the point of view of 
physics is confirmed again. 


11. Conclusion 

(1) The information capacity of electronic vision 
systems, C, is first of all determined by the number 
of picture elements, M, and under a considerable 
level of irradiation weakly depends on type of a 
system (daylight or thermal vision), spectral range 
(for example, 3-5 or 8-14 /im), quantum yield 
(intrinsic, extrinsic, or Schottky-barrier photodetec¬ 
tors), on detector configuration (linear or 2D array), 
and on its other parameters within the real limits of 
variation. With typical parameters of high-level-vi¬ 
sion systems, their ultimate information capacity 
conditioned by the fluctuations of the flux of the 
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photons forming an optical image is calculated ac¬ 
cording to the simplest formula: C = (l-2)Af 
bytes/frame. Higher values of the factor at M~ 
(1.8-2) correspond to an ideal photodetector device 
and lower ones (~ 1) correspond to a real photon 
detector device, in which the principal cause of the 
decrease of C is a limited electric capacitance of the 
storage cell. 

Somewhat smaller is the information capacity in a 
low-level system of night vision: C = (0.4-0.5)Af 
bytes/frame. 

(2) In the information potential, thermal vision 
does not yield to daylight vision under averaged 
conditions of thermal balance on the surface of the 
earth, when contraradiation leads to a decrease of 
contrast in the infrared image. This conclusion deter¬ 
mines the progress possibilities of practical thermal 
vision arising now owing to the use of two-dimen¬ 
sional array detectors. 

(3) The ultimate specific information capacity C* 
of electronic vision systems under a low level of 
irradiation rises with the density of optical channels 
up to the value for which the number of gradations 
in each channel becomes about two (‘the rule of 
binary channels’)- In this case, the maximum value 
of C * turns out to be proportional to the number of 
quanta emitted and reflected from a unit of area of 
an object, N e , in the spectral range of sensitivity of 
the detector device and with typical parameters of a 
system amounts to C* = N t /N * where N ’ = 10 5 
bytes -1 s -1 . 

Under high irradiation conditions, the specific 
information capacity of electronic vision systems, 
theoretically, is limited by diffraction. In this case, 
the frame area should be covered by squares of 
A X A size (A is the wavelength). And the number of 
these squares will be, with an error of less than 25%, 
the ultimate information capacity expressed in bytes 
per frame and 1 cm 2 of image area. 

(4) Nature created the eye to be optimum in 
information capacity, giving it the spectral range 
0.4-0.7 /cm: here, in the ultimate case, the specific 
information capacity is two orders of magnitude 


higher and the minimum volume of the optical sys¬ 
tem is three orders smaller than those for infrared 
images. 
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